Winner-Take-All EM Clustering

نویسندگان

  • Vasileios Kandylas
  • Lyle H. Ungar
چکیده

The EM algorithm is often used with mixture models to cluster data, but for efficiency reasons it is sometimes desirable to produce hard clusters. Several hard clustering limits of EM are known. For example, k-means clustering can be derived from EM in a Gaussian mixture model by taking the limit of all variances going to zero. We present a new method of deriving Winner-Take-All versions of EM that can be used for mixtures, such as heteroscedastic Gaussians, where it is not possible to take that limit. The resulting clusters can have non-convex boundaries, allowing for some of the clusters to reside “inside” others, producing dense foreground clusters embedded in a more diffuse background. Experiments show that using unequal variances can give better clusters on real data sets in terms of external quality measures.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Experimental Comparison of Several Clustering and Initialization Methods

We examine methods for clustering in high dimensions. In the first part of the paper, we perform an experimental comparison between three batch clustering algorithms: the Expectation–Maximization (EM) algorithm, a “winner take all” version of the EM algorithm reminiscent of the K-means algorithm, and model-based hierarchical agglomerative clustering. We learn naive-Bayes models with a hidden ro...

متن کامل

SAR imagery segmentation using probabilistic winner-take-all clustering

This paper applies a recently-developed neural clustering scheme, called "probabilistic winner-take-all (PWTA)", to image segmentation. Experimental results are presented. These results show that the PWTA clustering scheme signiicantly outperforms the popular k-means algorithm when both are utilized to segment a synthetic-aperture-radar (SAR) image representing ship targets in an open-ocean scene.

متن کامل

Classifying with Gaussian Mixtures and Clusters

In this paper, we derive classifiers which are winner-take-all (WTA) approximations to a Bayes classifier with Gaussian mixtures for class conditional densities. The derived classifiers include clustering based algorithms like LVQ and k-Means. We propose a constrained rank Gaussian mixtures model and derive a WTA algorithm for it. Our experiments with two speech classification tasks indicate th...

متن کامل

SAR image processing using probabilistic winner-take-all learning and artificial neural networks

This paper develops a two-stage approach for the identii-cation of ship targets in airborne synthetic aperture radar (SAR) imagery representing open ocean scenes. The rst stage of the developed approach segments the SAR image using a novel neural clustering scheme, called "probabilis-tic winner-take-all (PWTA)." As for the second stage, it employs a backpropagation (BP) neural network to classi...

متن کامل

Comparing Clustering Algorithms for the Identification of Similar Pages in Web Applications

In this paper, we analyze some widely employed clustering algorithms to identify duplicated or cloned pages in web applications. Indeed, we consider an agglomerative hierarchical clustering algorithm, a divisive clustering algorithm, k-means partitional clustering algorithm, and a partitional competitive clustering algorithm, namely Winner Takes All (WTA). All the clustering algorithms take as ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009